Tracking talking faces with shape and appearance models
نویسندگان
چکیده
This paper presents a system that can recover and track the 3D speech movements of a speaker’s face for each image of a monocular sequence. To handle both the individual specificities of the speaker’s articulation and the complexity of the facial deformations during speech, speaker-specific articulated models of the face geometry and appearance are first built from real data. These face models are used for tracking: articulatory parameters are extracted for each image by an analysis-by-synthesis loop. The geometric model is linearly controlled by only seven articulatory parameters. Appearance is seen either as a classical texture map or through local appearance of a relevant subset of 3D points. We compare several appearance models: they are either constant or depend linearly on the articulatory parameters. We compare tracking results using these different appearance models with ground truth data not only in terms of recovery errors of the 3D geometry but also in terms of intelligibility enhancement provided by the movements.
منابع مشابه
Shape and appearance models of talking faces for model-based tracking
This article presents a system that can recover and track the 3D speech movements of a speaker’s face for each image of a monocular sequence. A speaker-specific face model is used for tracking: model parameters are extracted from each image by an analysis-by-synthesis loop. To handle both the individual specificities of the speaker’s articulation and the complexity of the facial deformations du...
متن کاملRobust Parameterized Component Analysis Theory and Applications to 2D Facial Modeling
Principal Component Analysis (PCA) has been successfully applied to construct linear models of shape, graylevel, and motion. In particular, PCA has been widely used to model the variation in the appearance of people’s faces. We extend previous work on facial modeling for tracking faces in video sequences as they undergo significant changes due to facial expressions. Here we develop person-speci...
متن کاملLearning to recognise talking faces
An approach for person identification is described based on spatio-temporal analysis of the talking face. A person is represented by a parametric model of the visible speech articulators and their temporal characteristics during speech production. The model consists of shape parameters, representing the lip contour and intensity parameters representing the grey level distribution in the mouth r...
متن کاملFrom Dynamic Texture to Dynamic Shape and Appearance Models: An Overview
In modeling complex visual phenomena one can employ rich models that characterize the global statistics of images, or choose simple classes of models to represent the local statistics of a spatiotemporal segment, together with the partition of the data into such segments. Each segment could be characterized by certain statistical regularity properties in space and/or time. The former approach i...
متن کاملShape Invariant Recognition of Segmented Human Faces using Eigenfaces
This paper describes an efficient approach for face recognition as a two step process: 1) segmenting the face region from an image by using an appearance based model, 2) using eigenfaces for person identification for segmented face region. The efficiency lies not only in generation of appearance models which uses the explicit approach for shape and texture but also the combined use of the afore...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Speech Communication
دوره 44 شماره
صفحات -
تاریخ انتشار 2004